From Pixels to Graphs= Open-Vocabulary Scene Graph Generation with Vision-Language Models

Posted 2025-03-13Updated 2026-03-30Reviewa few seconds read (About 6 words) visits

From Pixels to Graphs= Open-Vocabulary Scene Graph Generation with Vision-Language Models

From Pixels to Graphs= Open-Vocabulary Scene Graph Generation with Vision-Language Models

http://chen-yulin.github.io/2025/03/13/[OBS]Reconstruct Anything-Relation-From Pixels to Graphs= Open-Vocabulary Scene Graph Generation with Vision-Language Models/

Author

Chen Yulin

Posted on

2025-03-13

Updated on

2026-03-30

Licensed under

#Scene-graph Visual-Relation Research-paper Multi-modal VLM CV Open-Vocabulary

Comments